WHAT IS CLAIMED IS: 

1 1. A system for managing a plurality of distributed nodes of a network, 

2 comprising: 

3 a recovery module configured to migrate from one network node to another, 

4 determine a status of a network node, and initiate a recovery process on a failed 

5 network node. 

1 2. The system of claim 1, wherein the recovery module comprises a 

2 routing component for determining a next hop address from an origin network node 

3 to a destination network node. 

l 3. The system of claim 2, wherein the routing component is configured to 

O 2 determine the next hop address based upon a routing table stored at the origin 

fS 3 network node. 

«; l 4. The system of claim 1, wherein the recovery module is configured to 

•U 2 determine the status of a network node by sending an inter-process communication 

E I 5 ! 

g. 3 to a node process. 

D 

:?; l 5. The system of claim 1, wherein the recovery module is configured to 

Kl 2 determine the status of a network node in accordance with a heartbeat messaging 

''2 3 protocol. 

1 6. The system of claim 1, wherein the recovery module is configured to 

2 initiate a recovery process on a failed network node in accordance with a restart 

3 protocol. 

1 7. The system of claim 6, wherein the recovery module is configured to 

2 initiate a restart of a failed node process by transmitting a request to a process 

3 execution service operating on the failed network node. 

1 8. The system of claim 1, wherein the recovery module is configured to 

2 transmit a node status message to a network management module operating at a 

3 network management network node. 
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1 9. The system of claim 8, wherein the node status message comprises 

2 information obtained from a log file generated at the failed network node. 

1 10. The system of claim 1, further comprising a network management 

2 module configured to launch a plurality of recovery modules into the network. 

1 11. A method for managing a plurality of distributed nodes of a network, 

2 comprising: 

3 migrating from one network node to another; 

4 determining a status of a network node; and 

5 initiating a recovery process on a failed network node. 

1 12. The method of claim 11, wherein migrating from one network node to 

2 another comprises determining a next hop address from an origin network node to a 

3 destination network node. 

1 13. The method of claim 12, wherein the next hop address is determined 

2 based upon a routing table stored at the origin network node. 

1 14. The method of claim 11, wherein the status of a network node is 

2 determined by sending an inter-process communication to a node process. 

1 15. The method of claim 11, wherein the status of a network node is 

2 determined in accordance with a heartbeat messaging protocol. 

1 16. The method of claim 11, wherein a recovery process is initiated on a 

2 failed network node in accordance with a restart protocol. 

1 17. The method of claim 16, wherein a restart of a failed node process is 

2 initiated by transmitting a request to a process execution service operating on the 

3 failed network node. 

1 18. The method of claim 11, further comprising transmitting a node status 

2 message to a network management module operating at a network management 

3 network node. 
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1 19. The method of claim 11, further comprising launching into the network 

2 a plurality of recovery modules, each configured to migrate from one network node 

3 to another, determine the status of a network node, and initiate a recovery process 

4 on a failed network node. 

1 20. A computer program for managing a plurality of distributed nodes of a 

2 network, the computer program residing on a computer-readable medium and 

3 comprising computer-readable instructions for causing a computer to: 

4 migrate the computer program from one network node to another; 

5 determine a status of a network node; and 

6 initiate a recovery process on a failed network node. 
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